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(54)Tltk: MODIFYING PROTCIN IMMUNCX^ICITY 
(57)Abflncl 



h hknown diat proteins, or parts of them, may be rendered ooo- or less immunogenic to ^umans or other species by Identifying 
one or more potcmia) T cell epitopes and climinaiing them by amino acid modification. Conventionally, certain epitopes may be retained 
to a protein sequence if die peptides constinuing such epitopes are pitscnt in endogenous human protein, since d»y would be recognised 
as W. However, u has now been found that even self epitopes may give rise to inunune leaetions. The inveoiioo prwidei for their 
elimination, for example by recombinant DNA technology, to tender them mom ludiil for admintstiiiion to humans, for example for 
therapeutic or diagnostic puiposes. 
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MODIFYING PROTEIN IMMUNOGENICITY 

THE PRESENT INVENTION relates to proteins to be administered especiaUy to 
humans particularly for therapeutic use but also for use in diagnostic tests. The 
5 invention particularly provides for proteins which are modified to be less 
immunogenic than the unmodified counterpart when used in vivo. . 

The invention particulaily addresses the clinical need for a process by which the 
natural tendency of the host to mount an immune response against an administered 
10 protein is substantially reduced or eliminated. There are several examples where the 
administration of protein molecules offers therapeutic benefit. However the benefit is 
greatly reduced particularly where multiple doses of the therapeutic protein are 
required as the recipient immune system recognises and accelerates the elimination of 
the incoming therapeutic protein. 

15 

There are a number of therapeutic proteins whose therapeutic use is curtailed on 
account of their immunogenicity in man. For example, when murine antibodies are 
administered to patients who are not immunosuppressed, a majority of patients mount 
an immune reaction to the foreign material by making human antt-^nuiine antibodies 
20 (HAMA). There are two serious consequences. First, the patient's anti-murine 
antibody may bind and clear the therapeutic antibody or immunoconjugate before it 
has a chance to bind to (he tumour and perform its fonction. Second, the patient may 
develop an allergic sensitivity to the murine antibody and be at risk of anaphylactic 
shock upon any fiiture exposure to murine immunoglobulin. 

25 

Several techniques have been employed to address the HAMA problem and thus 
enable the use in humans of therapeutic monoclonal antibodies (see, for example WO- 
A^909622, EP-A-02394G0, EP-A^)438310, W(>A-91 09967). A common aspect of 
these methodologies has been the introduction into the therapeutic antibody, which in 
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general is of rodent origin, of significant tracts of sequence identical to that present in 
human antibody proteins. Such alterations are also usually coiq^led to alteration of 
particular single amino add residues at positions considered critical to maintaining the 
anuTwdy-antigcn binding interaction. For antibodies, this process is possible owing to 
5 the very high degree of simctural (and functional) conservatism between anU'body 
molecules of difTercnt species. However for potentially therapeutic proteins where no 
structural homologue may exist in the host species (e.g. human) for the therapeutic 
protein, such processes are not applicable. Furthermore, these methods have assumed 
that the general introduction of human sequence will render the re-modelled antibody 
non-immunogenic. It is known however, that certain short peptide sequences ("T cell 
epitopes'*) can be released during the degradation of proteins within cells and 
subsequently presented by molecules of the major histocompatability complex (MHC) 
in order to trigger the activation of T cells. For pq)tidcs presented by MHC class II, 
such activation of T cells can then give rise to an antibody response by direct 
1 5 stimulation of B cells to produce such antibodies. None of the previous mrthods have 
addressed the elimination or avoidance of such epitopes in the final therapeutic 
molecule as a means to reduce or eliminate the antibody response against proteins. 
Nor have previous methods considered the elimination of peptides presented by MHC 
class 1 which can trigger a cytotoxic T cell response leading to cell killing. Such 
killing of cells causes the release of cellular components including proteins which can, 
in turn, activate specialist cells which are highly active in protein processing and MHC 
presentation and also in release of inflammatory cytokines as a result of such 
activation. As a result, an inflammatoiy environment can be created which promotes 
the more active uptake and processing of the protein for therapeutic use thus 
25 facilitating induction of an antibody response against the protem. 

The elimination of T cell epitopes from proteins has been previously disclosed (see 
WO-A-9852976) wher*y such potential T cell epitopes arc defined as any peptide 
within the protein sequence with the abDity to bmd to MHC class U molecules. Such 
30 potential epitopes are measured by any computational or physical method to establish 
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MHC binding. Implicit in the tcnn 'T cell q)itope" is an epitope which is recognised 
by the T cell receptor and which can at least in principle cause the activation of these 
T cells. It is however usually understood that certain pq)tides whidi are found to bind 
to MHC class n molecules may be retained in a protein sequence because such 
5 peptides are recognised as ''self' within the oiganism into which the final protein is 
administered. 

In practice, soluble proteins introduced into autologous organisms do sometimes 
trigger an immune response resulting in development of host antibodies which bind to 
10 the soluble protein. One example is mterferon alpha 2 to which a percentage of human 
patients make anUbodics despite the fact that this protein is produced endogenously. 

The present invention is based on the discovery by the inventors that MHC-binding 
peptides within autologous proteins can trigger immune re^onses in live organisms to 
those proteins even when the specific protein is endogenously produced. A possible 
explanation for this is that the doses of such proteins administeied are much higher 
than normal providing MHC-peptide formation which can activate T cells. 
Alternatively, the physiological environment into which such administered proteins 
are presented is one favourable to efficient antigen processing and peptide-MHC 
fonnation at higher levels than usually encountered, for example in inflammatory 
situations. 

According to a first aspect of the invention, there is provided a method of rendering a 
protein, or part of a protein, non-inununogenxc, or less inmnmogenic, to a given 
25 species, the method comprising: 

(a) determining at least part of the amino acid sequence of the protein; 

(b) identifyii^ in the amino acid sequence one or more potential epitopes for T 
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cells ("T cell epitopes*) which are found in an endogenous protein of the given 
species; and 

(c) modifying the amino acid sequence to eliminate ai least one of the T cell 
5 epitopes identified in step (b) thereby to reduce the inununogcnicity of the protein or 
part thereof when exposed to the immune system of the given species. 

The given species will usually be human. In step (c), one or more actual epitopes or 
potential epitopes may be eliminated. 

10 

Therefore, the present invention provides a modified method for creating proteins 
which trigger a reduced or absent immune response whereby one or more MHC 
binding peptides which are also found in the autologous organism's endogenous 
proteins are modified to reduce or eliminate binding to MHC molecules. Thus, in 

IS practice, no assumption is made about which peptides might be tolerated by the 
oiganism (unless data is available which indicates that tolerance is in place). Overall, 
the invention encompasses the creation of protein molecules with reduced or absent 
immunogenicity m live organisms where one or more peptides which bind to MHC 
molecules are eliminated &om the protein molecule. As such, the invention is an 

20 improvement on the prior art which focuses only on the removal of potential T cell 
epitopes from proteins with the assumption that the organism is tolerant to self 
proteins or peptide sequences. In particular, the invention includes a range of human 
and non-human proteins which are modified by modification of one or more MHC 
binding peptides within the sequences. Such MHC binding peptides can include 

25 peptides which hind to MHC class H molecules and/or pqjtides which bind to MHC 
class I. 

The a^ect of the invention described above is preferred to the alternative strategy of 
separately immunising the organism with such peptides in order to induce tolerance or 
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thc altcniative of administering analogues or fragments of the protein which resist 
binding or uptake into cells (especially antigen-presenting cells), for example by 
elimination of the cell binding site in the protein. Tolerising the organism to the 
peptide in this way ( eg, the use of MHC binding pq)tides identified fiom a syng^idc 
5 protein which can be used to induce tolerance in the host prior to admim'stration of the 
entire protein) would require two therapeutic molecules, one to perforai the main 
protein function and the other to induce the tolerance, which would not be ^voured 
from the regulatory point of view. 

10 Any method of identifying - in which term is included predicting - one or more 
potential T cell epitopes can be used in the invention. Acceptable methods may be 
computational or physical and include measuring of the binding of MHC-pq)tide 
complexes to T cell receptors, the testing of potential T cell epitopes in transgenic 
mice expressing human MHC molecules, the testing of potential T cell epitopes in 

!5 mice reconstiUited with human antigen-presenting cells and T cells in place of their 
endogenous cells, and the testing of potential T cell epitopes for stimulation of T cells 
in vitro via presentation of peptides on MHC molecules using syngeneic antigen- 
presenting cells. 

20 As well as identifying in the amino acid sequence one or more potemial T cell 
epitopes which are found in an endogenous protein of the given species, a method of 
the invention may, and typically will, involve additionally identifying and eliminating 
one or more potential T cell ephopes which arc not found in an endogenous protdn of 
the given species, since such epitopes are even more likely to be immunogenic. 

25 

The present invention is based on a new concq)t in development of therapeutic 
proteins. It has been recognised previously that, with exogenous T cell dependent 
proteins, helper T cell epitopes are important to developmem of a significant immune 
re^onse to the autok^ous protein. Such T cell epitopes function thiongh the internal 
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processing of the protein releasing peptides which complex with MHC class n 
molecules and which are then presented on the surface of impropriate cells for 
potential binding to receptors on T cells. However, it is very difficult to predict which 
peptides within the protein will be processed appropriately such that MHC binding can 
5 occur and it is also a factor that many processed peptides will not bind to all allotypic 
variants of MHC class U molecules (MHC restriction). Furthennore, in any given 
living oiganism, it is diificult to predict whether a T cell response will actually be 
triggered by a given peptide presented on MHC class n because T cells may have been 
tolerised to such epitopes or may not have the repertoire of T cell receptors to bind to 

10 the MHC class U-pcptide complex. Due to these complicating factors, it has 
previously been impractical to develop proteins with reduced or absent 
immunogenicity because of the difficulty in predicting acmal T cell epitopes. The 
invention herein primarily takes the new approach of creating improved therapeutic 
proteins fay removal of potential rather than acmal T cell q>itopes (usually defined by 

15 binding to MHC molecules) such that certain peptides within the molecule which are 
not actually inununogenic may be altered in addition to those immunogenic peptides. 
For a therapeutic molecule, preferably all of the potential T cell epitopes are removed 
whilst retaining activity of the protein. Preferably, this involves judicious choice of 
the amino acid substitutions enabling removal of the T cell epitopes and usually will 

20 involve the testing of a range of variant molecules with different amino acid 
substitutions. 

Not all identified potcmial T cell epitopes which are found in an endogenous protein 
need be eliminated. For example, immune responses in general are not mounted to 

25 autologous circulating proteins, such as immunoglobulins and other serum proteins 
such as serum albumin. So potential T cell epitopes which are found for example in 
gexmline immunoglobulin variable region protein sequences of the given species 
(ulkich will generally be human) may sometimes be ignored. However, an epitope 
considered as not generally available to the immune system to gain tolerance may be 

30 identified as a potential epitope for elimination according to the method of the present 
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inventioiL Examples of such include ^itopes include those of intracellular proteins, 
such as nuclear proteins and integral membrane proteins. 

The invention therefore provides proteins which have been altered to reduce their 
5 inununogenicity as well as a general method of altering such proteins to reduce their 
immunogenicity. A major principle of the present invention is that proteins are altered 
by identification of potential T cell epitopes and their subsequent alteration within the 
proteins in order to eliminate such potential epitopes. Optionally, epitopes recognised 
by host antibodies fB cell epitopes") can also be removed if these can be identified 
10 for example with the aid of patient antisera or if the surface residues in a non- 
autologous protein can be altered to those of a related autologous protein endogenous 
to the host organism without losing all of the desired activity of the non-autologous 
molecule. 

15 After elimination of one or more T cell epitopes, the protein may be tested for desired 
activity, it may retain its full activity, or it may retain a sufficient proportion of its 
original activity to be usefiil. In some circumstances, the activity may be altered either 
beneficially or at least in an acceptable way. Proteins entirely lacking in useful 
function post modification may be discarded. 

20 

Proteins of the present invention include those which have potential clinical use in 
humans or veterinary use in animals, especially where such uses involve multiple 
administrations of the proteins. They may be non-autologous or autologous. Proteins 
of the present invention include proteins with an enzymatic activity which has a 

25 beneficial therapeutic efTect, proteins which arc used to convert inactive drugs to 
active drugs within living organisms, proteins which are used to vaccinate whereby 
certain immunogenic epitopes aie undesirable, proteins which perfonn as cairien of 
other molecules within the living organism and proteins which bind to other molecules 
within or mtroduccd within the living organism in order to alter the biodistribution of 

30 the other molecules. Amongst the examples of non-autologous proteins with potential 
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benefit for in vivo human use are the following: Thrombolytic proteins streptokinase 
and staphylokinase; pro-drug activating enzymes such as asparaginase and 
caiboxypeptidase G2; toxins of plant, fungal and bacterial origin such as ricin, 
saponin, pokcwecd antiviral protein, mistletoe lectin, cholera toxin, pertussis toxin, 
5 diphtheria toxin; bacterial phospholipase C (eg. PE33, PE35, PE37, PE38. PE40); 
ribotoxins such as alpha-saicin, clavin, mitogiUin, restrictodn and bryodin-1 and 2. 
Other significant oon-autologous molecules of proven and potential therapeutic benefit 
include streptavidin, cobra venom factor, insulin, collagen, non-autologous antibodies, 
non-autologous MHC molecules and non-autologous T cell receptor molecules. 

10 Autologous proteins of proven and potential ther^eutic value include interferon 
a and p. granulocyte macrophage colony stimulating factor (GMCSF), granulocyte 
colony sUmulating factor (GCSF). tissue plasminogen activator (t-PA), insulin. Factor 
VIII, Factor IX, erythropoietin (EPO). pituitary growth horaionc and megakaryocyte 
growth and development factor (MGDF) and derivatives of these proteins produced by 

15 recombinant methods. Other examples of proteins include recombinant molecules 
derived or comprised of single or multiple domains from autologous or non- 
autologous proteins, recombinant proteins engineered to change or modify fimction or 
protem sequences derived fmm any of the above molecules. 

20 A typical protocol within the general method of the present invention comprises the 
following steps: 

L Detennining the amino acid sequence of the protein or a part thereof (if 



U. Identifying potential T cell epitopes within the amino acid sequence of the 
protein by any method including determination of the binding of pq)tides to 

MHC molecules, detemiination of the binding of pcptidc:MHC complexes to 



modification only of a part is required); 



25 



30 



the T cell receptors from the species to receive the therapeutic protein, testing 
of the protein or peptide parts thereof using transgenic animals with the MHC 
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molecules of the species to receive the therapeutic protein, or testing with 
transgenic animals reconstituted with immune system cells from (he species to 
receive the therapeutic protein; 

By genetic engineering or other methods for producing modified proteins, 
altering the protein to remove one or more of the potential T cell epitopes and 
producing sudi an altered protein for testing; 

IV. (optionally) Within stq) Dtt., altering the protein to remove one or more of the 
1 0 potential B cell epitopes; 

V. Testing altered proteins with one or more potential T cell epitopes (and 
optionally B cell epitopes) removed in order to identify a modified protcm 
which has retained all or pan of its desired activity but which has lost one or 

1 S more T cell epitopes. 

Potential T-ccU epitopes herein are defined as specific peptide sequences which cither 
bind with reasonable efficiency to MHC class II molecules (or their equivalent in a 
non-human species), or which in the forai of peptide:MHC complexes bind strongly to 

20 the T cell receptors from the species to receive the therapeutic protein or which, from 
previous or other studies, show the ability to stimulate T-cells via presentation on 
MHC class n. The method of the present mvention recognises that an effective T cell- 
dependant immune refuse to a fineign protem requires activation of the cellular aim 
of the immune system. Such a response requires the uptake of the therapeutic 

25 (foreign) protein by antigen presenting cells (APCs), Once inside such cells, the 
protein is processed and fragments of the protein form a complex with MHC class II 
molecules and are presented at the cell surface. Should such a complex be recognised 
by binding of the T cell receptor from T-ceils, such cells can be, under certain 
conditions, activated to produce atunuhitory cytokines. The cytokines will elicit 

30 difierentiation of B-cells to fiill antibody producing cells. In addition, such T cell 
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responses may also mediate other deleterious effects on the patient such as 
inflammation and possible allergic reaction. 

It is understood that not ail peptide sequences will be delivered into the correct MHC 
5 class n cellular compartment for MHC class n binding or will be suitably released 
form a larger cellular protein for subsequent MHC class 11 binding. It is fiiilher 
understood that even such peptides which are presented by MHC class II on the 
surface of APCs may not elicit a T-ccU response for reasons including a lack of the 
appropriate T-cell specificity, tolerance by the immune system to the particular peptide 
1 0 sequence or the low affinity of the MHC-peptide complex for T cell receptor. 

The present invention provides for removal of human (or other given species) potential 
T-cdl epitopes from the thwapeutic protein whereby the primary sequence of the 
therapeutic protein can be analysed for the presence of MHC class II binding motifs by 

15 any suitable means. For example, a comparison may be made with databases of 
MHC-binding motifs such as, by searching the **motifs" database at the world-wide 
web site httD:/farehih.wehi.edu.au/mhcpep/ (foimeriy wehil.wehi.edu.au or 
wehi.wehi.edu.auV Alternatively, MHC class II binding peptides may be identified 
using computational threading methods such as those devised by Altuvia et al. ((/ 

20 Mol. Biol 249 244-250 (1995))). Similar methods may be used to identify and 
remove epitopes which bind to MHC class I. 

Having identified potential given species (e.g. human) T-cell epitopes, these epitopes 
are then e lim inated by alteration of one or more amino acids, as required to eliminate 

25 the T-cell epitope. Usually this will mvolve alteration of one or more amino acids 
within the T-cell epitope itself This could involve altering an amino acid adjacent to 
the q)itope in terms of the primaiy stiucnue of the protein or one which is not adjacent 
in the primaiy structure but is adjacent in the secondary structure of the molecule, liit 
usual alteration contemplated will be amino acid substitution, but in certain 

30 circumstances amino acid deletion or addition will be t^propriate. In some instances, 
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selection of the appropriate amino acid substitution at any particular point in the 
therapeutic protein primaiy structure, can be made with reference to an homologous 
protem primaiy structure. This is particularly the case where a repertoire of 
homologous genes exist, as fbr example witii immunoglobulins, or a single 
5 homologous gene for the therapeutic protein exists, as is the case for the cobra venom 
factor protein and human complement factor C3 detailed in Example 1. However in 
other cases, &ere may not be an homologous protein to the therapeutic protein as is the 
case for potential therapeutics such as the thrombolytic agent staphylokinase. In such 
cases amino acids may be selected on the basis of similar size and/or charge, or more 

10 preferably and where stmctural data for the protein exist in combination with or with 
reference to in silico protein modelling techniques. In some instances amino acid 
substitutions may be made to prevent proteolytic cleavage of a protein and thereby 
inhibit binding of a peptide to MHC class n. Examples of protease cleavage sites 
include those reported by van Noort & van der Drift (J. Biol. Chan. 264 14159-14164 

15 (1989)) and van Noort et al {Eur. / ImmunoL 21 1989-1996 (1991)). Examples of 
amino acids which prevent protease digestion have been identified e.g. Kropshofer et 
al.,J. Immunol. 151 4732-4742 (1993). 

In the method of the present invention^ usually a number of variants of altered proteins 
20 will be produced and tested for retention of the desired activity. Where it is desirable 
to maximise the removal of potential T cell q>itopes from the protein, it will be 
common to create a range of variants including some with potential T cell epitopes 
remaining in the molecule. It will be recognised that with certain protein molecules, it 
is difficult to radically alter the molecule and to retain full activity and so judicious use 
25 of molecular modelling of variants and the testing of a maximum number of variants is 
desirable. For molecular modelling, standard commeroially available software 
packages can be used to model the protein structure using as starting point either of a 
ciystal structure or a model built from homology or protein folding prediction. 
Infonnation on parts of the protein involved in imparting the molecule with its activity 
30 will assist in modelling variants enabling best choice of altered amino acids which 
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remove a potential T cell epitope (or optional B cell q>itope) whilst retaining activity. 

In the mediod of the present invention where a number of variants of afteied proteins 

will be produced and tested for retention of the desired activity, it is desirable to have 
a high-throughput method for screening large numbers of variants. Such methods 
include in vivo methods for expression of gene variants in cdls such as bacteria with 
rapid isolation of the proteins for example using an antibody to a conserved region of • 
the proteins, or via a protein purification tag included in the protein sequence. As an 
alternative, in vitro methods such as in vitro transcription and translation can be used 
in some cases. Where the protein binds to another molecule, display methods such as 
phage display and ribosome display can be used in order to select out variants which 
retain binding activity. 

In the practice of the present invention, specific amino acid alterations can be 
conducted by recombinant DNA technology, so that the anal molecule may be 
prepared by expression from a recombinant host using estabUshed methods. However, 
the use of protein chemistry or any other means of molecular alteration is not excluded 
in the practice of the invention. Whilst the present invention provides a method for 
removal of potential T cell epitopes (and optional B ceU epitopes) from protein 
molecules, the method does not exclude testing of variant molecules produced within 
the invention for actual T cell epitope activity including testing of the protein or 
peptide parts thereof using transgenic animals with the MHC molecules of the spedts 
to receive the ther^cutic protein, or testing with transgenic animals reconstituted with 
immune system cells from the species to receive the thenq)eutic pioteia 



Whilst such methods are not yet routine, in vUro T cell assays can be undertaken 
whereby the protein can be processed and presented on MHC class II by appropriate 
antigen^resenting cells (APCs) to syngeneic T cells. T cell leqxNtses can be 
measured by simple proliferation measurements (especUlly if the APCs have been 
irradiated or otherwise treated to prevent proliferation) or by measuring qiecific 



25 
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cytokiiie release. In otda to accounl for dificrent MHC class U allotypes, a 
range of in vivo assays will osuaUy be nquired m oidcr to broadly test fbr T-cdl 
epitopes. Alternatively) traosgenic asiroals eqmpped with tuxmxtn (or the desired 
spedes) MHC class n molecules could be used to test for T-cell epitopes especially 
S where &e host MHC oUss O repertoire has been removed and espedally where one or 
more other host aeocssoiy molecalcs in APCH'-odl interaction have also been rqplaced 
with buman (or fliedesirod species) such as CD4 on Thetis. 

In a further aspect, the invention provides molecules resulting fiom method as 
described above. Sudi molccales may be usefiil for treating or preventing a disease or 
10 condition. The invention also extends to the use of such molecules in in vrw tad in 
vitro diagnosis and fbr human or veterinary use. Prcfened features of each aspect of the 
invention are as for each other aspect, mutatis mutandis. 

Hie present invention provides a method of rcmlerinfi an endogenous 
intTBcelhilar protein, or part of a protein, noiioimi&uaogenlG; or less immunogcmc, to a 
16 givea species, the mdhodconqirising: 

(a) detenniniog at least part of the amino add scqfueoce of said endogenous 
intracellular protein; 

(b) identifying in the amino add sequence one or more potential qnUypes for 
T cells {**T cell qiitopcs^ which are found in said endogenous intracellular protein; and 

20 (c) modifyice die amino acid sequence to eliminate at least one of the T edl 

epitopes identified in step (b) thereby to reduce the immunogenidty of &e protein or 
part thereof when exposed to (hfi immune system of the grvcn spedes. 

Aay i ^^f^g"^ of docuscnts. acts, materials, devices, aitides or die like whidi 
has been induded In &e present apedficaiion is solely ibr the pun>ose of providittg a 

25 context for the present iavoition. It is not to be taken as an ad m issi o n chat any or all of 
these mattes form part of the pzicr an base or wen common gencEBi Icnowledge in ibe 
field relevant to the present invention as it existed before the priority date of eadi daim 
of this application. 

Throu^sout this spedfioadon die word "comprise", or variations sndi as 
30 'comprises" or "oon^iiidng", will be understood to in^ly the indusion of a stated 
dement, integer or stq), or groi^ ofdementSpintegBEEorsteps^butnottheexdusionof 
any other demen^ int^er or step, or group of dements, integers or steps. 

Tbt invouion is illustra^ but not limited by the following eacainples. The 
examples xttfor to the following figures: 
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FIGURE 1 shows the protein sequence of mature cobra venom factor, 

FIGURE 2 shows the protein sequence of an altered cobra venom factor-chain; 

FIGURE 3 shows the protein sequence of an altered cobra venom &ctor-chain; 

FIGURE 4 shows the protein sequence of an altered cobra venom factor-diain; 

FIGURE 5 shows the protein sequence of streptokinase fix»m Streptococcus 
equisimilis: 
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FIGURE 6 shows the protein sequence of an altered streptokinase molecule; 
FIGURE 7 shows the protein sequence of mature st^hyloldnase; 

5 

FIGURE 8 shows the protein sequence of an altered staphylokinasc molecule; 

FIGURE 9 shows the protein sequence of wild-type biyodin 1 ; 

10 FIGURE 10 shows the protein sequence of an altered bryodin 1 molecule; 

FIGURE 1 1 shows the protein sequence of mature human interferon 2 ; and 

FIGURE 12 shows the protein sequence of an altered human interferon 2 
IS molecule. 

Example I 

In the present example the cobra venom factor (CVF) protein is analysed for the 
presence of potential MHC binding motifis and a method disclosed for the removal of a 
20 number of these from the molecule. 

CVF is the non-toxic complement-activating glycoprotein in cobra venom. The 
mature CVF protein consists of three polypeptide chains a, p and y, produced by post- 
tianslational processing of a pre-pro protein of approximate molecular wd^t ISOkDa 
25 (Fritzingcr DC. e/ ai, Proc. Natt Acad. ScL USA 91 12775-12779 (1994)), The 
a and p chains are both linked to the y chain via single disulphide bridges. In addition, 
both the a chain and the y chain have single intra chain disulphide bonds whilst the p 
chain has a complex arrangement of six fimhcr intemal disulphide bonds. The manire 
protein has four potential sites for N-glycosylation of which only three are used 
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(Vogel. CW & MuUcr-Ebcihard J, Immunol 18 125-133 (1984)). 

In the preferred embodiment of the present invention, production of the reduced 

immunogenicity or aliercd VOTioD of the therapeutic protein is by rxxombinant DNA 
technology. For the present example^ it is understood that the expedient of 
recombinant production of the mature CVF protein alone is likely to provide a 
considerable reduction in immunogenicity of the protein. This assertion stems fiom 
the knowledge that a degree of the immunogenicity arising fiom administration of 
CVF purified from Naja naja venora» arises fiom antibody responses to the particular 
pattern of glycosylation not encountered in mammalian proteins (Gowda. D.C et al., J. 
Immunol 152 2977-2986 (1994)). It is also understood that deglycosylated and also 
sialylated CVF that would arise from recombinant production of CVF. for example in 
a mammalian producer cell system, have equal functional activity with native mature 
CVF (CJowda. D.C et al (1994) ibid,). 



In humans CVF triggers the altemaiivc (properdin) pathway of complement activation. 
CVF forms a bimolecular complex with factor B which is cleaved by factor D into Bb 
which remains bound to CVF to form the C3/C5 convcrtase CVF^b. This complex is 
functionally and stnicturally analogous to the human C3/C5 convcrtase formed during 
the activation of the alternative pathway. The clinical importance of CVF lies in the 
high relative stability of the CVF,Bb complex versus the native C3/C5convertase [tyn 
of 7 hours versus txn of 90 seconds) and in particular, the inability of CVF.Bb to be 
disrapted and hence down regulated by factors H and 1 (Gowda, D.C et ai, (1994) 
ibid). The net resuh of CVF administration therefore is to lead to the depletion of 
complement in human serum by continuous eflicient and uncontrolled complemem 
activation. 

This property has been exploited as a research tool for in vivo and in vUro do- 
complementation (Cochrane CG et al., J. Immunol. 105 55 (1970)) and has also been 
exploited for the selective elimination of nimour cells as a conjugate with a nmiour 
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targeting monoclonal antibody (Vogcl, C.-W. & Muller-Eberhard HJ. Proc. Nat'l 
Acad. Sei. USA 78 7707 (1981)). Availability of a non-immiinogcnic CVF would have 
considerable importance as a potential thenq>eutic sigent for the depletion of 
complement in the plasma of patients undergoing organ xenotransplantation or in the 
5 construction of second generation antibody conjugates for targeted activation of 
complement in cancer patients. 

Method for identincatioo of potential MHC class n binding rootffi in cobra 
Venom Factor: 

10 The sequence of CVF was identified finom die GenBank database. The sequence with 
accession number UD9969 was used throughout. The protein sequences of the mature 
a, P and y chains were identified (Figure 1) and analysed individually for the presence 
of potential MHC class 11 binding motifi. Analysis was conducted by computer aided 
comparison to a database of MHC-binding moti£s as resident on world wide web site 

15 httprZ/wehihwehiedufln/mhepftp/ 

Results of the "seardiing** process on the a chain, indicate the presence of 638 
potential MHC class 11 binding motifs. Of these. 525 matched sequences identified in 
a database of human geimhne immunoglobulin variable region protein sequences. 

20 These epitopes were not considered further on the basis that immune responses in 
general are not mounted to autologous circulating proteins such as immunoglobulins. 
This implies immunological tolerance to the potential T-cell epitopes present in the 
structure of the immunoglobulins (and indeed the majority of human proteins). 
Epitopes presented by non-autologous proteins such as CVF which are identical or 

25 similar to motife present in immunoglobulin proteins are likely also to be tolerated and 
in practice may be retained through the de-immunisation process. 

Following subtraction of the human immunoglobulin protein gemiline motift, the 
remaining 113 potential q)itopes in the a chain, were analysed individually for 
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Similarity to non-iimnunoglobulm protein sequences. In practice, predicted anchor 
residues for each potential epitope were used in a consensus sequence search of human 
expressed proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, WI, USA). 
5 Epitopes identified in known circulating human proteins were not consid^ed further 
and were therefore allowed to remain unchanged within the CVF a chain. An 
example of one such rejected potential epitope is given by the sequence DCPGMPY at 
positions 332038 in the CVF a chain (numbering fiom first amino acid of the mature 
a chain). This sequence represents a predicted consensus binding motif for HLA- 

10 DRQBl»0301 with anchor residues underlined. Database searching using the 
consensus sequence FxxxMxY identifies only a single entry in a human protein sub- 
set of the SwissProt database coiresponding to human serum biotinidasc (SwissProt 
accession U A54362). An example of an epitope where no match to a human protein 
considered to be in the general circulation was found is provided by sequence 

15 lYQVGNNELat position 501-509 in the CVF o chain. This sequence ttprcsenls a 
potential epitope for presentation by HLA-DRB1*0101. Consensus sequence 
searching identifies only four human proteins containing this motif, of which three are 
nuclear proteins of differentiated tissues such as brain, and the fourth is an integral 
membrane protein. These may be considered as not generally available to the immune 

20 system to gain tolerance and therefore identify this as a potential epitope for 
elimination according to the method of the present invention. Similarly, a further 
potential HLA-DRl binding motif was identified in the a chain. Peptide sequence 
XVWQVTG2 at positions 81-89. This motif identifies a single human nuclear 
protein DKA polymerase alpha (GenBank accession # M64481), in the same data set 

25 and was also therefore identified for modification by the method of the present 
invention. Following these processes, a total of 15 potential epitopes in the a chain 
were considered for removal by amino acid substitution. 

Results of the -searching** process on the p chain, indicate the presence of 366 
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potential MHC class n binding motiiis. Of these, 281 matched sequences identified in 
a database of human geraUine immunoglobulin variable region protein sequences and 
were not considered further. Following a second round of subtractions using a 
database of human senim protein sequences (as above), 18 potential epitopes were 
S considered for removal 

Results of the ^'searching** process on the y chain, indicate the presence of 267 
potential MHC class fl binding motifs. Of these, 219 matched sequences identified in 
a database of human gennlinc immunoglobulin variable region protein sequences and 
0 were not considered further. Following a second round of subtractions using a data 
base of human serum protein sequences (as abovcX 9 potential q)itopes were 
considered for removal. 

The net result of these processes was to identify those residues within the CVF 
5 molecule which should be altered to eliminate potential MHC class 11 binding motifs. 
Individual amino acids within the predicted binding motifs were selected for 
alteration. With the object of maximising the likelihood of maintaining protein 
functional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. In many cases, individual amino acid or even short strings (<5 
0 consecutive residues) were subsUtuted by reference to the published sequence of 
human complement factor 3 (C3) (GenBank accession # K02765) which shows areas 
of strong homology, including regions of identity, with CVF. In two instances, 
potential epitopes were eliminated from the P chain by insertion of additional amino 
acids. Amino acids were selected with reference to die human 03 protein. 



Altered CVF a chain, p chain and y chain sequences were compiled (Figures 2 to 4) 
and fiirther analysed by database comparison, as previously, for confirmation of 
successful elimination of potentia] MHC class II binding motifs. 
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Method for constnictioD of altered CVF molecales: 

POl primers CVFlfor, 5*ATAAGAATGCGGCCGCATGGAGAGGATGGCTCTCT 
and CVFlrev; 5'ATAAGAATGCGGCCGCTATCATOATTCTTCTGAAC were 
used to ampliiy a 4985bp firagmcot from a Xgtl 1 cobra venom gland cDNA libraiy 

5 (FritzingCT DC et al., ibid). The PGR was conducted on a total Ubrary DNA 
preparation using high fidelity polymerase mix optimised for long distance PGR 
{Advantage PCR Kit, Clontech. Basingstoke, UK) and conditions suggested by the 
supplier. The PGR product was cloned into pcDNA3.l Onvitrogcn. Leek. The 
Netherlands) as a Not\ restriction fragment using standard techniques (Sambrook J., 

0 Fritsdi E.F. & Maniatis T. (eds.) in: Molecular Cloning a Uboratoiy Manual, Cold 
Spring Haibor Laboratory Press. NY, USA (1989)). The gene sequence was 
confinned to be identical to database entries using commercially available reagent 
systems and instmctions provided by the supplier (Amersham. Little Chalfont. UK). 
Site directed mutagenesis was conducted using synthetic oligonucleotides and die 

5 "quick-change" procedure and reagents from Stratagene (Cambridge. UK). - Mutated 
(de-immunised) versions of the gene were confinned by sequencing. Mutated CVF 
genes were tiansfected into CHO cells by electroporation. Stable transfectants were 
selected usiqg 041 8. and clones secreting active CV? were selected using a C 
consumption assay (Gowda D.C et al. (1994) ibid; Cochrane C.G. et al (1970) ibid) 

0 Expressing clones were expanded and recombinant protein was purified from the 
culture si^ematant using sequential column chromatography and methods as 
described (Vogel C-W et al., J, Immunological Methods 73 203-220 (1984)). 

Example 2 

5 The present mvention details a process whereby potentially immunogenic q)itopes 
within a non-auto!ogous protein may be identified and offas methodology whereby 
such q)itopes may be eliminated. It is understood that there arc a number of proven 
therapeutic proteins for which their therapeodc use is curtailed on account of their 
immunogenicity in man. In the present example the therapeutic protein streptokinase 
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is analysed for the presence of potential MHC binding motiis and a method disclosed 
for the removal of a number of these from the molecule. 

Streptokinase (SK) is a single chain protein of approximate molecular wdght 47kDa 

5 that is produced by certain strains of p-haemolytic streptococci (Huang T.T. et aL, 
Mol Biol. 2 197-205 (1989)). The protein has no inherent enzymatic activity but has 
considerable clinical importance owing to its ability to efficiently bind human 
plasminogen, potentiating its activation to plasmin and thereby promoting the 
dissolution of fibrin filaments in blood clots. Several studies have shown that SK is an 

10 effective thrombolytic agent in the treatment of coronary thrombosis, improving 
survival (ISIS-2 CoUaborative Group. Lancet 2 349-360 (1988)) and preserving left 
ventricular fimction following myocardial infarction (ISAM Snidy Group, M Engl. J. 
Med, 314 1465.1471 (1986); Kennedy J.W, et aL, Circulation 77 345-352 (1988)). 
Despite the undoubted therapeutic value of SK, the non-autologous origin of the 

15 protein is disadvantageous due to its immunogenicity in humans. The production of 
neutralisiiig antibodies in the patient in generally limits the protein to a single use. 

Method for identmcatioD of potential MHC class n binding motiis in 
streptokinase: 

20 The sequence of streptokinase was identified from the GenBank database. Hie 
sequence with accession number S46536 was used throughout (Figure S). Hie 
sequence was analysed for the presence of potential MHC class II binding moti& by 
computer aided comparison to a database of MHC-binding motifi as residem on world 
wide web site http://wehih.wehi.edu.au/mhcpep/ . 



Results of the "seardiing" process indicate the presence of 395 potential MHC class U 
binding motifs. Of these, 283 matched sequences identified in a database of human 
germline hnmunoglobulin variable region protein sequences. These epitopes were not 
considemd fiirther on the basts that inunune responses in general are not mounted to 
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autologous circulating proteins such as immunoglobulins. Tliis implies 
inmiunological tolerance to the potential T^II epitopes present in the structure of the 
immunoglobulins (and indeed the majority of human proteins). Epitopes presented by 
non-autologous proteins such as SK which are identical or similar to motifis present in 
5 immunoglobulin proteins are likely also to be tolerated and in practice may be retained 
through the de-immunisation process. 

Following subtraction of the human inununoglobuHn protein gcnnlinc motifs, the 
remaining 1 12 potential epitopes were analysed individually for similarity to non- 
10 immunoglobulin protein sequences. In practice, predicted anchor residues for each 
potential epitope were used in a consensus sequence search of human expressed 
proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, Wl. USA). 
Bpitopes identified in known circulating human proteins were not considered further 
15 and were therefore allowed to remain unchanged within the SK molecule. An 
example of one such rejected potential ^itope is given by the sequence jLLKAIQEQL 
at positions 79-87 in the SK protein. This sequence represents a predicted consensus 
binding motif for HLA-DR1*0101 with anchor residues underlined. Database 
searching using the consensus sequence LxxAxxxxL identifies >4000 entries in a 
20 human protein sub-set of the SwissProt database, including scrum albumin protein 
(SwissProt accession number P02768). An example of an epitope where no match to a 
human protein considered to be in the general circulation was found is provided by 
sequence rVDYNHLat position 299-305 in the SK protein. This sequence represents 
a potential epitope for presentation by HLA-DR4*0401. Consensus sequence 
25 searching identifies < 50 human proteins containing this motii; of which many are 
intracellular proteins of differentiated tissues such as brain. These may be considered 
as not generally available to the immune system to gain tolerance and therefore 
tdeotily this as a potential epitope for elimination according to the method of the 
present invention. Similarly, a further potential HLA-DRl*010l binding motif was 
30 identified in the SK peptide sequence KADLLKAI at positions 76-83 of the SK 
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protein. This motif identifies < 150 human proteins in the same data set and was also 
identified for modification by the method of the present invention. 

The net result of these processes was to identify those residues vrithin the SK molecule 
5 which should be altered to eliminate potential MHC class Q binding motifs. 
Individual amino acids within the predicted binding moti& were selected for 
alteration. With the object of maximising the likelihood of maintaining protein 
functional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. A new (de-immunised) SK sequence was compiled (Figure 6) and 
1 0 further analysed by database comparison, as previously, for confumation of successful 
elimination of potential MHC class n binding motifs. 

Metbod for constroctioo of de-immunised SK molecules: 

PCR primers SKI (5*-ggaattcatgattgctggacctgagtggctg) and SK2 (5*- 

15 tggalccttatttgtcgttagggtatc) were used to amplify the wild-type SK gene from a strain 
of Streptococcus equisimililis group C (ATCC accession number 9542). The resulting 
I233bp fragment was cloned into pUCI9 as a BtttnSl'EcdS^ restriction fragment 
using standard techniques (Sambrook J., Fritisch E.F. & Maniatis T. (eds.) in: 
Molecular Ctoning a Laboratory Manual, Cold Spring Harbor Laboratory Press, NY, 

20 USA (1989)). The gene sequence was confiraied to be identical to database entries 
using commercially available reagent systems and instructions provided by the 
supplier (Amersham, Little Chalfont, UK). Site-directed mutagenesis was conducted 
using synthetic oligonucleotides and the "quick<hange^ procedure and reagents from 
Stratagene UK Ltd. Mutated (de-immunised) veRions of the gene were confirmed by 

25 sequencing. Mutated SK genes were sub^loned as EcoV^-BamVl fragments into the 
bacterial expression vector pEKG-3 (Estrada M.P. et ai^ Bio/Technology 10 1 138- 
1 142 (1992)) for expression of de-immunised SK. Recombinant protein was purified 
using a plasminogen afHnity column according to the method of Rodriguez ei at. 
(Rodriguez P. et c/., Biotechniques 7 638-641 (1992)). Fibrinolytic activity was 

30 assessed using the casein/plasminogen plate technique and the ui vitro clot lysis assay 
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as described by Estrada et al (Estrada et al., ibid.). 
Example 3 

In Ac present example staphylokinasc is analysed for the presence of potential MHC 
5 binding motife and a method disclosed for the removal of a number of these fiom the 
molecule. 

St^hylokinasc protein finom Staphylococcus aureus has recognised and well 
characterised profibrinolytic properties. Recombinant forms of the protein have been 

10 previously produced and in vitro and in vivo stitdies have indicated that the protein 
holds considerable promise for thrombolytic therapy (Sako, T., Eur. J. BiochenL 149 
557-563 (1985); Schlott, B. et al.. Biotechnology 12 185-189 (1994); CoUen, D. et al.. 
Circulation 94 197-462 (1996)). However, clinical use in humans has been limited 
due to the demonstrated immunogenicity of the protein in man (Collen, D. ei al., 

15 Circulation 95 463-472 (1997)). Availability of a non-iramunogcnic staphylokinase 
would have considerable importance as a potential agent for thrombolytic therapy. 

The manire staphylokinase protein consists of a single polypeptide chain of 137 amino 
acids with approximate molecular weight 15.4kDa (Silence K. et al.^ J Biol 
20 Chemistry, 270 27192-27198 (1995)). 

Method for identification of potential MHC class U binding motib io 
staphylokinase: 

The sequence of st^hylokinase (sakSTAR) protem as given in table 1 of CoUen et al. 
25 (CoUen, D « a/. (1996) ibid.) was used throughout The protein sequences of the 
mature staphylokinase (Figure 7) were analysed for the presence of potential MHC 
class H biiuling motifi. Analysis was conducted by computer aided comparison to a 
database of MHC-binding moti& as resident on worid wide web site 

http'y/wghjh.wyiH.^H,^p/ntf>gppp/. 
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Results of the **searching" process indicate the presence of 128 potential MHC class 11 
binding motifs. Of thesc» 91 notched sequences identified in a database of human 
gennline immunoglobulin variable region protein sequences. These epitopes were not 
considered further. Epitopes presented by non-autologous proteins such as 
staphylokinase which are identical or similar to motifs present in immunoglobulin 
proteins are likely (o be tolerated and in practice may be retained through the de- 
immunisation process. 

Following subtraction of the human immunoglobulin protein gcnnlinc motifs, the 
remaining 37 potential epitopes were analysed individually for similarity to non- 
immunoglobulin protein sequences. In practice, predicted anchor residues for each 
potential epitope were used in a consensus sequence search of human expressed 
proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, WI, USA). 
Epitopes identified in known circulating human proteins were not consid»ed funher 
and were therefore allowed to remain unchanged within the protein. 

The net result of these processes was to identify those residues within the 
staphylokinase molecule which should be altered to eliminate potential MHC class n 
bindiiig motifs. Individual amino acids within the predicted binding motifs were 
selected for aheration. With the object of maximising the likelihood of maintaining 
protein functional activity, in all cases conservative amino acid substitutions were 
chosen at any given site. 

An altered staphylokinase sequence was compiled (Figure 8) and further analysed by 
database comparison, as previously, for confirmation of successfiil elimination of 
potential MHC class n binding moti&. 

Method for construction of altered sUpbylokinase molecules: 

A wild-type staphylokinase gene was synthcsised under contract by Geoosys 
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Biotechnologies Ltd (Cambridge. UK). The gene was constructed by PCR using long 
(80-mer) ovcriapping synthetic primcis and the sequence as given by Collen. D et ai 
(CoUen, D. et cd., (1996) ibid,) The synthetic gene was cloned as a 453 bp fcoRI- 
Hin^ restriction fragmCTt into bacterial expression vector pMEX (MoBiTcc. 
5 Gottingen, Germany). The gene sequence was confinncd to be identical to daUbase 
entries using commodally available reagent systems and instructions provided by the 
supplier (Amersham, Uttie Chalfom, UK). Altered (reduced inununogenicity) 
versions of the gene were engineered using site directed mutagenesis of the wild-type 
gene in pMEX. Short (18-mer) synthetic oligonucleotides and the "quick-change" 
10 procedure and reagents from Stratagene (Cambridge, UK) were used to create the 
variant genes. All variant gene sequences were confinned by DNA sequencing. 

Mutated stapbylokinase genes were transformed into £. coli strain TGI by standard 
techniques. A single transformed clone was selected and clones secreting active 
15 staphylokinase were selected using a fibrin plate assay (Astrup. T. et al.. Arch. 
Biochem, Biophys. 40: 346-351 (1952); Collen. D. et ai, Fibrinoiysis ^: 203-213 
(1992)). The best expressing clone was grown up and recombinant protein was 
purified from the culture supernatant using sequential cohmm chromatography and 
methods as described previously (CoUen, D. et ai, (1992) ibid,; Schlott et ai,, ibid.). 

20 

Example 4 

In the present example bryodin 1 is analysed for the presence of potential MHC 
binding motifs and a method disclosed for the removal of a number of these fiom the 
molecule. 

25 

The gene for bryodin 1 protein has recently been cloned fiiom Bryonia dionia, a 
member of the Cucurbitaceae femily of plants (Gawlak, S. et al.. Biochemistry 36 
3095-3103 (1997)). Biyodin 1 is a type I ribosomc inactivating proteia Studies using 
recombinant fonns of the protein have indicated that bryodin 1 holds considerable 
30 promise for immunotoxin therapy for cancer and other diseases (Gawlak, S. et al., 
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(1997) ibid.). However, clinical use in humans as with other immimotoxin agents, is 
likely to be curtailed due to immunogenicity of the protein in man. Availability of a 
non-inununogenic biyodio would have considerable importance as a potential 
component in immunotoxin based therapies. 

5 

The mature biyodin protein consists of a single polypeptide chain of 267 amino acids 
with approximate molecular weight 29kDa. The wild-type sequence is illustrated in 
Figure 9. 

10 Mefbod for identificatioD of potential MHC class n binding motifs fn bryodin 1: 

The sequence of bryodin I protein as given by Gawlak e( al, (Gawlak, S. et ai, (1997) 
ibid.) was used throughout. The protein sequence of the mature bryodin I (Figure 9) 
was analysed for the presence of potential MHC class n binding motifs. Analysis was 
conducted by computer aided comparison to a database of MHC-binding motifs as 
15 resident on world wide web site httO!//wehih.wehLedu,aiiAnhcpqi/. 

Results of the "searching- process indicate the presence of 315 potential MHC class U 
binding motife. Of these, 259 matched sequences identified in a database of human 
geraaline immunoglobulm variable region protein sequences. These epitopes were not 
20 considered iiirther. 

Following subtraction of the human immunoglobulin protein gcrmline motifs, the 
remaining 56 potential epitopes were analysed individually for similarity to non- 
immunoglobulin protein sequences. The predicted anchor residues for each potential 
25 epitope were used in a consensus sequence search of human expressed proteins. The 
SwissProt and GenBank translated sequence databases were interrogated using 
commercially available software (DNAstar Madison, WI, USA). Epitopes identified 
in known circulating human proteins were not considered further and were therefore 
allowed to remain unchanged within the protein. 

30 
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Thc net result of these processes was to identify those residues within the biyodin 1 
molecule which should be altered to eliminate potential MHC class II binding motife. 
For biyodin 1 protein, 13 potential epitopes were identified for elimination from the 
molecule. Individual amino acids within the predicted binding motifis were selected 
5 for alteration. With the object of maximising the likelihood of maintaining protein 
functional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. 

An altered biyodin 1 sequence was compiled (Figure 10) and fiirther analysed by 
10 database comparison, as previously, for confiimation of successful elimination of 
potential MHC class n binding moti&. 

Method for construction of altered bryodin 1 molecules: 

A wild-type bryodin I gene was synthesised under contract by Genosys 

15 Biotechnologies Ud (Cambridge, UK). The gene was constructed by PCR using long 
(80-mer) overlapping synthetic primeis and the sequence as given by Gawlak et ai, 
(Gawlak, S. et al^ (1997) ibid,). The synthetic gene was cloned as a 843bp Ncol- 
EcdRl restriction fragment into a modified version of bacterial ejcpression vector 
pET22b+ (Novagcn, Madison, USA). The vector was modified to remove the pelB 

20 leader sequence which was previously shown to impede efficient expression of the 
biyodin I protein (Gawlak, S, et td., (1997) ibid). The modification was conducted by 
digestion with Xba\ and /\fco\ to remove lOTbp of DNA encompassing the pcIB leader 
sequence. Elements of non-pelB sequence including the ribosome binding site 
sequence and the A^col site, were restored in the vector by ligation of a linker molecule 

25 to the XbaVNcol free ends in the vector. The linker was formed by annealing 
complementary oligonucleotides: 

Llf (5*-Gtagaaaiaattttg;tttaactttaagaaggagatatacatatgcc) and 
Llr (5'-ccatggatatgtatatetccttcttaaagttaaacaaaattattt). 

30 
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Oligonucleotides were supplied with phosphoiylatcd ends by Gencsys 
Biotechnologies Ltd (Cambridge. UK). Restriction digests, DNA purification, ligation 
reactions ae. were conducted using standard procedures and conditions recommeaded 
by the regent suppliers. Altered (reduced immunogenicity) versions of the gene were 
5 engineered using site directed mutagenesis of the wild-type gene in the modified 
pET22b vector. Short (18-mer) synthetic oligonucleotides and the **quick-changc" 
procedure and reagents torn Stratagene (Cambridge, UK) were used to create the 
variant genes. All variant gene sequences were confirmed by DNA sequencing. 

10 Wild-type and mutated bryodin 1 gene variants were transfonned into £. coU strain 
TGI by standard techniques. A single transformed clone was selected for each geoe 
and this clone used for sequence analysis. For expression work, bryodin 1 genes were 
transformed into E. coli strain BL21(XDE3) obtained from the ATCC Recombinant 
bryodin 1 and variant bryodin 1 proteins were purified using methods described 

15 previously (Gawlak, S. et a/., (1997) ibid.). Following re-folding of the cmde protein 
fipom inclusion bodies, purified bryodin 1 and variants were obtained > 95% pure using 
CM-Scpharose chromatography as previously (Gawlak, S. et a/., (1997) ibid.). 
Activity of the recombinant proteins was assessed using a ccll-ftec protein synthesis 
inhibition assay and methods according to Siegall et al.^ (Siegall, C.B. et al., 

20 Bioconjugate Chenu 5 423-429 (1994)). 

Example 5 

In the present example the human imerferon a2 protein is analysed for the presence of 
potential MHC btndmg motifs and a method disclosed for the removal of a number of 
25 these from the molecule. 

Interferon alpha 2 (INA2) is an important glycoprotein cytokine expressed by 
activated macrophages. The protein has antiviral activity and stimulates the 
production of at least two enzymes; a protein kinase and an oligoadenylate synthetase. 
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on binding to the interferon alpha receptor in expressing cells. The mature INA2 
protein is single polypeptide of 165 amino acids produced by post-translational 
processing of a 188 amino acid pre<ujsor protein by cleavage of a 23 amino acid 
signal sequence from the amino teraiinus. 

5 

The protein has considerable clinical importance as a broad spectrum anti-viral, anti- 
proliferative and immunomodulating agent Recombinant and other preparations of 
INA2 have been used therapeutically in a variety of cancer and viral indications in 
man (reviewed in Sen, GG. and Lcngyel P, J. Biol. Chem. 267 5017-5020 (1992)). 

10 However despite veiy significant therapeutic benefit to large numbers of patients, 
resistance to therapy in certain patients has been documented and one important 
mechanism of resistance has been shown to be the development of neutralising 
antibodies detectable m the serum of treated patients (Quesada, JJL et aL, J, Clin. 
Oncology 31522-1528 (1985); Stein R.G. et al.. New Eng. J. Med. 318 1409-1413 

15 (1988); Russo. D. et al., Br. J. Haem. 94 300-305 (1996); Brooks M.G. et qL, Gut 30 
1116-1122 (1989)). An immune response in these patients is mounted to the 
therapeuUc interferon despite the fact that a molecule of at least identical primary 
stiucture is produced endogenously in man. In the present example, an interferon 
alpha 2 molecule with reduced potential immunogenicity in man is presented. 

20 

Method for identirieatioa of potential MHC class II binding motifs Id human 
interferon alpha 2: 

The sequence of INF2 was identified from the GcnBank database. The sequence with 
acccsaon number P01563 was used throughout The protein sequences of the mature 

25 INF2 protein was identified and the sequence excluding the first 23 amino acid signal 
peptide analysed for the presence of potential MHC class H binding motifs (Figure 
11). Analysis was conducted by computer using MPT verl.O software (Biovation, 
Aberdeen, UK). This software package conducts •'peptide threading" according to the 
methods disclosed in WO-A-9859244. The software is able to provide an index of 

30 potential peptide binding to 18 different MHC class H DR alleles covering greater than 
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%% of the HLA-DR.allotypes extant in the human population. 

Results of the '^peptide thzeading** process on the INF2, indicate the presence of a total 
of 18 individual potential epitopes. The q)itopes map to five distinct dusleis of 
5 ovcriapping epitopes encompassing residues 7^0 (cluster I), residues 45-70 (clusier 
2), residues 7^103 (cluster 3), residues 108-132 (cluster 4) and residues 140-163 
(cluster 5). Each of the five clusters contain 5, 3, 4, 3 and 3 potential epitopes 
reflectively. 

In order to prioritise epitopes for removal, the epitope clusters were then mapped to 
the known sinicture-fimction feanircs on the molecule. For INF2 evidence from 
homology modelling (Muigolo N.J. et ai.. Proteins: Structure, Function & Genetics 
17 62-74 (1993)). site directed mutagenesis (Tymms, M.J. et c/., Antiviral Res. 12 37- 
48 (1989); Mclnncs B. «r ai, J. interferon Res. 9 305-314 (1989)), cross-species 
chimacric molecules (Ra, N.B.K. e( al, J, Biol Chem, 2d3 8943-8952 (1988); 
Shaficrman. A. ei ai J. BioL Chenu 262 6227-6237 (1987)), dcleUon mutants (Wetzel, 
R. ei al., to: Interferons New York Academic Press, pp819-823 (1982)) and 
serological mapping studies (Lydon N.B. et ai. Biochemistry 24 4131-4141 (1985); 
Trotta P.P. et al.,. In: The Interferon System Dianzani F & Rossi 0.B (eds.) New York, 
Raven Press. 1985 pp23U235), suggest epitopes in chisters I and 2 are the highest 
priority targets for removal. With reference to the structural model of Murgolo et a! 
(Murgolo NJ. et al (1993) ibid,\ chistcr 1 featuring 5 potcnUal epitopes encompasses 
the functionally important helix A and the AB surface loop region of the INF2 
molecule. This region is important for the antiviral activity and is involved in binding 
to the human INF2 receptor strucnire. Most significantly, epitopes for neutralising 
antibodies have been mapped to this region, specifically residues 10-1 1 of the helix A 
strucmral element (Lydon KB. et al., (1985) ibid,). 

On this basis, cluster 1 epitopes al positions 7-19 and 13-25 were eliminated by 
substitution of threonine for leucine al position 15 (L15T using single letter codes). 



15 



20 
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This procedure was conducted interactively in silico using the MPTvcrl package. 
Similarly, cluster 1 q}itope at position 28-41 was eliminated by substitution F27Y. 
This latter substitution had (he concomitant effect of reducing the number of potential 
binding alleles for the overbipping epitope at position 22-34 fiom 1 3 to 1 1 . 



Cluster 2 epitopes extend fiom the AB loop through the helix B region and into the 
EC loop domain. Neutralising antibodies have been shown to bmd in the BC loop 
(Lydon N.B. et al., (1985) ibid.; Trotta P.P. et al.,, (1985) ibid ) therefore cluster 2 
epitope at position 58-70 was also targeted for removal. Substitution NdsL reduces 
10 potential immunogenicity in this region by eliminating 3 overlapping epitopes 
encompassing positions 61-77, which collectively are predicted to bind 5 diffcreni 
MHC DR alleles. Additionally, this substimtion also reduces the number of different 
binding alleles at epitope 58-70 fiom 5 to 3. 

15 Other epitope clusters (e.g. 3-5) map to cither buried regions of the molecule or 
surface regions not involved in receptor binding and hence provide antigenic sites to 
which antibody responses are largely non neutralising. 

20 Using the above method, an altered INF2 protein sequence was compiled containing 3 
substimtions fiom the starting sequence and is depicted m Figure 12. This sequence is 
predicted to be significantly less immunogenic with respect to human MHC class n 
presentation. The areas of reduced immunogenicity focussed to regions of the 
molecule where antibody mediated neutralisodon of the protein has been shown to 

25 occur with the potential to limit (he clinical efficacy of the molecule as a therapeutic 
entity. 

Method for coostnictioD of altered interferon alpha 2 molecule: 

A wild-type INA2 gene was synthcsised under contract by Genosys Biotechnologies 
30 Ud (Cambridge, UK). The gene was constructed by PCR using long (80-mcr) 
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overlapping synthetic DNA primers and the sequence given in GcnBank accession 
number M29883. The synthetic gene was cloned as a 520bp £coRI-//iBdin restriction 
fragment into bacterial expression vector pM£X8 (MoBiTec, Gotlingen, Germany). 
The gene sequence was confirmed to be identical to the desired gene using 
commercially available reagent systems and instmctions provided by the supplier 
(AmerBham, Little Chalfont, UK). 

Altered (reduced immunogenicity) versions of the protein were constnicied by site 
directed mutagenesis of the wild-type gene in pMEX8. Mutagenesis was conducted 
using short (l8-mer) synthetic oligonucleotides obtained commercially (Genosys, 
Cambridge, UK) and the "quick-change" procedure and reagents from Stiatagene 
(Cambridge, UK). Following site directed mutagenesis, DNA sequences of selected 
clones were coniimied as previously. 

15 For expression of recombinant wild-type and recombinant variant INA2 proteins, 
pMEX8-INA2 expression plasmids were transformed into £. coli strain JA221 and 
cells grown and harvested using standard procedures (Sambrook J„ Fritisch E.F. & 
Maniatis T. (1989) ibid). Recombinant INA2 was prepared essentially as described 
previously ((jrosfeld. H. er a/., in Advances in Biotechnological Processes 4, Mizrahi 

20 A. & Van Wczel A. L. eds.. pp59-78 Alan R, Liss Inc. New York (1985)) with minor 
modifications. Briefly, following high speed centrifugation, the supernatant was 
concentrated by 60% saturation of ammonium sulphate and then chromatographcd on 
a 2.7 X 68cm Sephadex G-75 column equilibrated with PBS plus 0,5M NaCl. 
Fractions (Sml) were collected at a ImiuMU flow rate. Pooled fi:actions were iimher 

25 purified using an immunoaffinity column as described previously ((siosfeld, H. er a/., 
(1985) ibid.). Purified proteins were assayed using SDS-PAGE analysis and 
functional activity (anti-viral activity) detemnined using biological assays as described 
previously (Shaffetman. A. et al, J. Biol Chem. 262 6227-6237 (1987)). 
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THE CLAIMS DEFININO THE INVENTION AM AS FOLLOWS: 

1. A method of rendering u endogeoous intFBCcUulsr proteiii. or pail of i protein, 

Tiftiv tmm unftggmc, or tess inmiunogcnlc, to a given species, the method oomprising: 
(ft) dctennniing ot least part of the amino &dd sequence of said endogenous 
S intracellulfirpiotem; 

(b) Ideotifyiag in die amino acid sequence one or more potentiAl epitopes for 
T cdU ceil epitopes'^ which arc found is said endogenous iotracellular protein; and 

(c) modi^g the amino edd sequence to elinunate at least one of the Teell 
q>itope8 identified in step (b) theteby to teduce the immunogenidty of the proton or 

10 part thereof when exposed to the immune system of the given species. 

Z A method as daimed in claim 1, wherein the given speciee is human. 

3. A method as claimed in claim 1 or 2. wtierein the T cell epitopes am identified 
15 by computation. 

4. A method as claimed in any of the claims 1 to 3, wherein a T cell epitope is 
diminated by alteration of one or more amino acids wiiAiin the epitope fitsdf. 

20 '5. A method as claimed in claim 4, wherein the alteration is an amino add 



6. A method as daimed in claim 5. wherein the amino acid substitoiion is made on 
the basis of similar size and/or charge. 

25 

7. A method as claimed in daim 4 or S, wherein the amino add suhstitulion is 
made with reference to in sitico protein moddUag techniqaes, 

8. A method substantially as hereinbefore desoibed with refereace to the exan^les 
30 and/or the preferred embodiments and exduding. if any. comparative examples. 

Dated this tenth day of August 2004 

BiovatioA Limited 

Patent Attoineys fiar the Applicant* 

3$ 

FBRICE&CO 



COUStDllKSBMM0Bfi4t70 ReeelMdlvlPAiidiala:TlRM{H3R}l75SOttt(y4»<2tX)44)8-10 



-36- 





wo 0004317 



PCT/GB99/D4tl9 



1/4 



A 

a-cbain 



ALYTLITPAVLRTDTEEQILVEAHGDSTPKQLD I FVHDFPRKQKTLFQTRVDMNPAGGMLVTP 

TIE1PAKEVSTDSR0NQYVWQVTGPQVRLEKWLLSYQSSFLPIC3TDKGIYTTOSPVLTO 

5f0>HllTSXMNKWXVEFQTPB6ILVSSNSVDLNFFWPYNLP0LVSL^ 

AYPDVRKYVLPSFEVRLQPSEKPFYlIXaiENFHVSITARYLyGEEVEGVJ^^ 

SI PDSLTRI PI IDGDGKATlJaU)TFRSRFPmJJELVGHTLYASVTVMTESGSDMVVTO 

IVASPYQIHFTKTPKYFKPGMPYELTVYVTNPDGSPAAHVPWSSAPHSMOTTLSDGTAKLIL 

NI PLNAQSLP ITVRTNHGDLPRBRQATKSMTAIAY QTQGGSGNYLHVAITSTBIKPGDKLPVN 

raVKGNANSLKQl KYFTYLILNKGKI FKVGRQPRRDGQNLVTMNLHITPDLI PSFRFVAYYQV 

GNNB I VADSVWVDVKDTCMGTLWKGDNLI QMPGAAMKI KLEGDPGARVGLVAVDKAVYVLND 

KYiaSQAKIWDTIEKSDFGCTAGSGQNNLGVFEDAGUaiTTSTXaj^ 



EIQMPTHKDLNLDITIELPDREVPIRYRINYENALLARTVETKLNQDI TVTASGDGKATMTI L 
TFYKAOWEKANVCMKFHUWSVENIHIJIAMGAKGAL>n,KICTR^^ 
FLPDAEDLTRLSKGVDRYISRYEVDMKMAQKVAVIIYLNXVSHSEDBCI^ 
PGSVK\nfSYYin-DEKCTKFyHPDKGTaLIJraCIGMVCRCAGETCSSUraQTO^ 
CSTlJVDYVYKTKUJlIEEQOGNDXYVHDVLSVIKQGTDENPRAKTHQyiSQRK 
DDYLIHGSRSDLLPTKDKISYIITKNTWIERWPHEDECX2EKEFQKLCDDFA0FSYTLTBFGCP 



DDNEDGFIADSDIISRSDFPKSWLWLTKDLTEEPNSQGISSKTMSFYU^SITTWVVLAVSFT 

PTKGICVAEPYEIRVMK\^FIDLWPYSVVKNEQVBIRAILHNYVNEDIYVRVELLYNPAFCS 

ASTKGQRYRQQFPimSSRAVPFVIVPLEQGIJOVEIKASVQEALWSDGVRKKLKVVPEGVQ 

KSIVTIV10LDPRA1C(5VGGTQLEV1KARKU)ORVPDTEIETKII1QG^ 

SIPO 



B 

p-chalo 



T 



c 



FIGURE 1 

Protein sequence of mature cobra venom factor 



A "alpha chain 
B = gamma diain 
C^betacham 
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ALYTLITPAVIATDTEEQILVEAHCaSTPKQIJJIFVHDFPRKQKTLTOTRVDmPJ^^ 

TIEI PAKEVSTDSRQNQYWVQVTGTQVRLEKVVLLS YQSSFLFIQTDKGIYTPGS PVLYRVP 

SKDIOTSIMNKTVIVEFQTPEGILVSSNSVDSIJJFFVPYNLPDLVSliGTWRIVAK^ 

TAYFDVJUCYVLPSFEVIVETOKFYYIIX3JENFHVS ITARYLyGEEVEGVAF\^FX^ DDAK 

ISLPBSLKRIPlIDGDGlCATLSW>TFRSRFPOTiNELVGKSLyVSAT\riTESGSDMVVTO 

HIVASPYOiHtTOTPKSfFKPGMPYBLTVFVTNPKSSPAAHVPVVVKAFHSMGTTLSDCT 

LNIPLNAOSLPITVRTtffiGDLPRERQATKSMTAIAyQTQGGSGKYUCVAITSTEIK^ 

NFNVKQIAMSIiAOIKYPTYLILNKGKIFKVGRQPREPGQDLVV^^ 

LIGASCaoTEIVADSVWVDNnCDSCVGSIVVKGDNLIQMPGAAMKIKIJXmPGARVGLV^ 

FVLNDKYKI SQAKI WDTVVEKADIGCTACSGQNOTjGVFEDAGIJU^TTSTNIOTKQRSJ^ 

PAN 



Protein seqaence of an altered cobra venom factor a-chain. 



DDNEDGFIADSDIISRSDFPKSWLWLTKDLTEEPNSQGISSKTMSFYLia)SITTWEVIAVSP^ 
PTKGICVADPFEVTVMKVFFIDLQMPySVVKNEQVBIRArLH^^nmEDLYVRVELLyNPA^^ 
ASTTGQRYRQQFPIKALSSRAWFVIVPLBQGUffiVEVKAAVYHHFISDGVRKKIJCWP 
KSTVTI VKLDPRAKGVGGTQLEVI KARKLDDRVPDTEIETKI I IQGDPVAQI lENSI DGSKLN 
SIPD 



Piotein setpience of an altered cobra venom factor y-^hain. 



EIQMPTHKDLinjDITIBLPDRBVPIRYRINYENASIJUlTVETKLNQDPTVTASGIX;^ 
TFYNAQWEKANVCNKFDUJVSVENIHLNAMGAKirrMIL^ 

PLPDAEDLTRLSKGVDRYISRyEVDNNMAQKVAVIIYIJDKVSHSEDDCIJIPKIIJaiFE^ 
PGSVKVYSyyiniDESCTRryHPDRGTGLLNKI CIGNLCRCAGBTCSSLNHQERVDVPLQI SKA 
CETNVDYVyKTIOJJlIEEQDaroEYVMDVLRVIKQGTDENQ 

DHYLIWGSRSDLLPTKDKISYIIGKDTWVEHWPEBDBCQEEEFOKLCDDFAQFSyTLTEPGCP 



FIGURE 2 



FIGURE 3 



T 
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lAGPEVOjUJRPSVlWSQLWSyAGTVECrrNQDISLKFrBIDLTSRPAHGGKTEQGLSPKS 

KPFATDSCStftPHIOJIKADUJCAIQEQLIANVHSNDDYPBVIDFASDATITDRNGKV^ 

raX3SVTLPTQPVQEFLLSGHVRVRPYKBKPIQ»QAKSVDVEYWQmLNP^ 

OTKLUCTIAICDTITSQEIAAQAQSILNKTHICVTIY 

FT!mVKNREOAYEINKKS«LNEEimm)LISEKYYVIJ^ 

IWNTNElJJCSBQLLTASEMMFIUJLYDPRDiaUCLLYNN^ 

TNRllTVnfKGKRPBGBNASYHIAYDimRYTBEERBVySYLRyTGTPIPDMPND^ 



lAGPmUJRPSVNNSQLWSVAGTVBGTMQDISLKPFEIDLTSRPAHGGKTEQGLSPKS 

KPFATDSGAMPHKLEKADLLXAKOEQLIANVHSNDDYFBVIDFASnATITDRNGKVYFAD 

KDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQAKSV0VEYTVQPTPLNPDDDFRPGLK 

DTKLLKTIAIGDTITSQELIAQAQSILNKTHPGYTIYERDSSIVTHDNDIFRTILPMDQE 

FTYHVKNREQAYEINKKSGlJJEEIlWrDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKFV 

DVITOELLKSEQ1ATASERNLDFRDLYDPRDKAKLLYNNU)AFGIMDYTLTGI^^ 

TNRI ITVYMGKRPEGENASYHIAYDKDRYTEEEREVYSYIiRYTGTPIPDNPNDK 



Protein sequence of an altered streptokinase molecule. 

SSSFDKGKYKKGDI>A5YFEPTGPYIWW\rPGVDSKGNELLSPIftVHFPIKPG m 




Protein sequence of mature staphyloldnase. 



SSSFDKGKYKBGDDASQFBPTGPYLMVNVTGVDSAGNALLSPHYVEFP I KPTTLTEERI KYYV 

BMALDATAYAAFAVVBIiDPSARVBVTYYDKMaCKBETKSPPITBKGFVVPOT^ 

FTKWIBKK 




FIGURE 6 




FIGURE 7 
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DVSFRLSGATTTSYGVFIKNLRKALPYERICVYNI PLLRSSI SGSGRYTLIiHLTHYADETISVA 

VDVTK\nf IMGYIJ^DVSYFFNEASATEAAKFVFKDAKKKV^ P 

LGLPALDSAITIXYYYTASSAASALLVLIQSTAESARYKFIEQQIGKRVDXTFLPSXATISLE 

NMWSiU^KOIQIASTNMGQFBSPVVUDGNNQRVSITNASARVV^ 

tSFfTLIGFBHGLYGI 



Protein sequence of wild-type biyodin 1 . 



DVSFSMSGATTTSYGVFVKNLREALPFERKVYNI PLLRSS I SGSGRYTLLHLTNYADETI SVA 

VDVTNVYIMGYLAGDVSyFFNEASATEAAKFVFKDAKKKVTLPYSGra P 

LGLPALDSAITTLYYYTASSAASALLVLIQSTAESARFKFIEQQIGKRVDXTFLPSLATISLE 

NNHSALSKQIQIASTNNGQFESPVVL>nX3NNQSVSITKASARVVT 

IShTTLIGFEHGLYGI 



Protein sequence of an altered biyodin 1 molecule. 



CDLPQTHSI^Smi^LLAQMRKISLFSCLKDRHDFGFPQEEPGNQFQKAETIPVLHEHIQQI 

FNLFSTKDSSAAWDETLLDKFYTELYCMUroLEACVIQGVGVTBTPUflKEDS 

TLYLKEKKYSPCAWEWRAEIMRSFSLSTHLQESliRSKE 



Protem sequence of mature human interfexon alpha 2 



CDLPCTTHSIiGSRRITKLIAQMRKISLFYCUa^RHDFGFPQEEFGKQFOKASTIPVI^^ 
FLLPSTKDSSAAWDETLLDKFYTELYQQLNDLEACVIQGVGVTETPLMKEDS IIAVRKYFQRI 
TLYLKEKKYSPCAWEWRABIMRSFSLSTNLQESLRSXB 



FIGURE 9 



FIGURE 10 



FIGURE 11 



FIGURE 12 

Protein sequence of an altered human interferon alpha 2 molecule 
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